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Abstract — This paper studies the design of codes for distributed 
storage systems (DSS) that enable local repair in the event of 
node failure. This paper presents locally repairable codes based 
on low degree multivariate polynomials. Its code construction 
mechanism extends work on Noisy Interpolating Set by Dvir et 
al. fl). The paper presents two classes of codes that allow node 
repair to be performed by contacting 2 and 3 surviving nodes 
respectively. It further shows that both classes are good in terms 
of their rate and minimum distance, and allow their rate to be 
bartered for greater flexibility in the repair process. 

Index Terms — Distributed storage systems, locally repairable 
codes, punctured Reed-Muller codes. 



I. Introduction 

The importance of 'cloud' storage has resulted in a growing 
body of work in both theoretical analysis and practical designs 
for efficient distributed storage systems. Conventionally, re- 
silience in distributed storage is obtained by simple replication 
of data; however, such replication can be highly inefficient 
in terms of the number of nodes required for this resilience. 
Thus, coding has come to play a central role in designs 
for resilient distributed storage systems (DSS). In particular, 
coding schemes for DSS must enable efficient system repair 
in the event of (a small number of) node failures 0. In 
|2), Dimakis et al., the authors consider the total amount 
of data downloaded during single node repair, i.e., repair 
bandwidth, as an important metric to gauge the efficiency of 
any coding scheme employed in DSS and presents an lower 
bound on repair bandwidth. Since then, multiple codes has 
been proposed that achieve this lower bound. 

In general, there are multiple, possibly apposing, metrics 
using which the performance of a DSS can be characterized 
such as security, locality, load-balancing and privacy; and the 
metric of interest in this paper is locality fl3], J4], 0. The 
goal of this line of research is to design coding mechanisms 
for DSS that enable node repair to be accomplished while 
requiring contact with only a small number of surviving nodes 
in the system. In Q, Gopalan et al. establish an upper bound 
analogous to the singleton bound on the minimum distance 
of locally repairable codes and show that pyramid codes J6] 
achieve this bound. Subsequently, the work in Prakash et 
al. extends the bound to more general definition of locally 
repairable codes Q- 

In this paper, our goal is to generalize & extend the existing 
literature on locality in repair (and decoding) in DSS |@], 
G), @. Our coding scheme builds on schemes studied in the 
domain of locally decodable codes (LDC) |8|. Specifically, 
our coding scheme employs a punctured Reed-Muller (RM) 
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code based on low-degree multivariate polynomials to store 
data. Our reason for choosing RM codes based on low- 
degree multivariate polynomials is the inherent locality of 
its codewords. Indeed, it is well known in LDC literature 
that RM codes generated using low-degree polynomials afford 
locality, at the cost of being low rate. In other words, the 
extent of redundancy required in these codes is comparatively 
much higher than a code without locality properties. This 
has rendered an RM code based LDC unattractive, as the 
advantage provided by locality is superseded by the large 
storage space requirement of these codes. 

In this paper, our approach is one of judiciously puncturing 
RM codes in order to obtain 'good' rates for the resulting 
codes while still retaining the locality property for repair. A 
naive approach to puncturing RM codes compounds problems, 
as one may lose aspects of algebraic structure that make RM 
so desirable, including loss in structured decoding strategies as 
well as locality. Keeping this in mind, we turn to a methodical 
approach for puncturing of RM codes as introduced in JT] . In 
(fl~), Dvir et al. develop an algorithm for puncturing RM codes 
based on low-degree polynomials which results in 'good' 
codes, i.e., codes with constant rate and constant relative dis- 
tance (in block length). Moreover, 0] also presents an efficient 
decoding algorithm for these punctured codes. However, JT] 
does not address locality properties of the resulting punctured 
RM code. In this paper, we show that a modified version 
of the punctured RM codes as studied in JJJ exist that are 
simultaneously 'good' from all three perspectives - rate (extent 
of storage), distance (resilience) and locality for repair of DSS. 

The remainder of this paper is organized as follows. In 
Sec. IUJ we provide a brief introduction to generalized RM 
codes from polynomial evaluation perspective with their prop- 
erties relevant to this paper. In Sec. |TTT] we define the notion 
of local repair and characterize the locality afforded by RM 
codes. In SeclIVIandlVl we present two closely related coding 
schemes for DSS, which enable local repair based on 2 and 3 
nodes respectively. 



II. Background: RM codes 

A generalized RM code lZM q (u,m) is defined with the 
help of irreducible polynomials from ¥ q [xi,. . . , x m ] of degree 
at most u. Here, F q [xi, . . . , x m ] denotes the ring of m-variate 
polynomials over field ¥ q , and a polynomial in this ring is 
called irreducible if its degree in each variable is less than 
q — 1. Throughout the paper, we assume ¥ q to be a prime 
field. Each irreducible polynomial of degree at most u gives a 
<7™ l -length codeword in lZA4 q (u,m) when this polynomial is 
evaluated at all q m points in F™. Thus, TZAi q (u,m) can be 
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defined as follows: 

TZM q (u,m) 

= {eval(/),?m G Ff :/ E ¥ q [x u . . . , x m ] & deg(/) < 

(1) 

where eval(f)w™ = (f(ai),...,f(a q ™)) denotes evalua- 
tions of the polynomial / at all q m points in F™. The dual code 
of 1ZAd q (u, m) is also an RM code and is generated by irre- 
ducible polynomials of degree at most u = (q — l)m — u — 1, 
i.e., 

(IZMqiu,™)) 1 - = TZM q (u,m) = KM q ((q-l)m-u-l,m). 
Minimum distance of lZA4 q (u, to) is given by the following 

d mm {KM q {u,m)) = (q- O)^^ 1 (2) 

where u = u(q — 1) + 6 with < 6 < q — 1. Moreover, 
Delsarte et al. also characterize the codewords of minimum 
weight for this code J5). These minimum weight codewords 
are associated with the polynomials of following form in 

Fq j • ■ • j ^m] ■ 

/(x) = wontx (i - (4(x) - c^) 9 " 1 ) nf =1 (Ui (x) - 25,-) 

(3) 

where {u>j}j =1 are distinct elements from ¥ q , and {w}f =0 are 
arbitrary elements from F g with luq ^ 0. Here, {^iQIfi 
represent /i + 1 linearly independent linear forms (functions) 
on F™. 

III. On Locality Properties of Codewords 

In this section, we illustrate the desired locality property 
in a codeword of a generalized RM code. First, we formally 
define the notion of locality of an encoded symbol: 

Definition 1: A particular symbol in a codeword has local- 
ity r if it can be recovered by accessing encoded symbols from 
only r other positions, i.e., it is uniquely defined by a set of 
r encoded symbols in a codeword. 

For a particular encoded symbol, possessing a locality of r 
is equivalent to having a codeword of support at most r + 1 
in the dual code such that the support of this codeword in 
dual code contains index of encoded symbol in interest. Since 
our object of study is the code lZM q (u, to), we focus on the 
codeword of minimum support in its dual code. For RM codes 
based on polynomials of degree at most u < q — 2, we know 
that the following holds for TZM q {u, m) = HM q (m(q—l)- 
u — 1, to): 

U x = (m - l)(q -l) + (q-u-2)=fi(q-l) + e. 

Thus, it follows from (O that d m i n (lZM q (u, to)) is u + 2. 
Next, we present a Lemma that establishes a necessary 
condition on the support of minimum weight codewords of 
KMq(u,m). 

Lemma 1: For every minimum weight codeword of 
TZAdg (u, m), u + 2 points corresponding to its support lie on 
on a line in F™. In other words, all u+2 points (pi , . . . , Pu+2) 
are of the form (pi, Pi +iih, . . . , pi +£ u +ih), where {ti}™={ 
are distinct and nonzero elements of ¥ q and h represents the 
direction of the line. 



Proof: Let c be a minimum weight codeword of 
TZM.^(u, q). From (O, c is obtained as evaluations of a 
polynomial of the following form at all points of F™: 

/(x) = (1 - &(x) - ui)"- 1 ) LTj-r 2 (Ux) - oJ,-) 

where are m linearly independent linear forms 

(functions) and {Wj}*~" -2 are distinct elements in ¥ q . Note 
that each linear form £{(■) can be represented by a vector 
L t G F™ containing the coefficient of £i(-). Let L be a 
to x m matrix which has Li as its i th row. Since £;(•) are 
linearly independent, L is a full-rank matrix. For u + 2 points 
(pi, . . . , p u +2) corresponding to the support of c, we have 

L Pi = il, (4) 

where = (u>i,lo2, ■ ■ ■ , w m _i,2i) T }?i . Now assume that 
not all u + 2 points lie on a line, i.e., without loss of generality 
there exist three points in the set of u + 2 points {pi, P2, P3} 
such that 

P2 = Pi + *ih and p 3 = pi + t 2 g 
where g ^ rh for any r G F g \{0}. From (|4]i we have 

Lh=fo,...,o,^Wfi h 

L g = (0,... ,0,^1) = n s . 

Note that it is possible to find (ai,a 2 ) 7^ (0,0) G F 2 , such 
that ai h h + a 2 n s = G F™. Thus, it follows from ©, 

L(aih + a2g) = ai^h + «2^g = 0. 

This, however, contradicts the full-rank nature of L as «ih + 
«2g is a nonzero vector and a full-rank matrix must have a 
trivial null space. So, all u + 2 points corresponding to support 
of a minimum weight codeword c must lie on a line. ■ 
In fact, an even stronger result holds in the sense that, 
given any u + 2 points on a line in F™, a minimum 
weight codeword of TZM. q (u, m) exists that is supported 
on these u + 2 points. Next, we illustrate a procedure for 
determining a polynomial in ¥ q [x\, . . . , x m ] with degree at 
most m(q — 1) — u — 1 that corresponds to a codeword in 
TZM. q (u, m) supported on a particular set of u + 2 points 
on a line (pi, . . . ,p M+2 ) = (pi,P2 +*ih...,pi + t u+1 h); 
where {ii}"^ 1 are nonzero distinct elements of ¥ q . First, pick 
a vector v = (0, . . . , 0, v m ) T G F™ which is nonzero only 
at the m th coordinate. Second, construct an invertible matrix 

F mxm such that 

Mv = iih, 

i.e., the last column of M is tih. Finally, define a polynomial 
<?(•) of degree m(q — 1) — q — 1 = (m—l)(q — l) + (q — u — 2), 

5 (x) = looH^ 1 (1 - (4(x) - UiY- 1 ) n«;r 2 (^™(x) - Ui) 

where linear functions £,(•) are defined by the i th row Li of 

L = M- 1 G F™ xm and (w u . . . , w m _i, Q) T = Lp u and 
{ujj} q jZi~ 2 ^ distinct elements in ¥ q \{uj,uj + |^i> m ,w + 

It follows from Lemma [T] and the construction mecha- 
nism described above that a minimum weight codeword of 
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1ZA4 m (u, q) can only be supported on u + 2 points on a line; 
and given u + 2 points on line, there exist multiple minimum 
weight codewords in 1ZA4 m (u, q) supported on these points. 
Note that there are multiple choices for the matrix M in 
©, and each choice of M results in a different codeword 
of TLM q (u, m) supported on u + 2 points. 

IV. Punctured RM Codes with Locality 2 

The stage is now set for a general method of designing 
coding schemes for DSS with locality 2 based on lZA4 q (l, m). 
Let G be a generator matrix of an [TV, m, 2eN+l] q linear code. 
We construct a set Si = {ai, . . . , ajy} C F™, where {a,}^ 2 
are N rows of G. For each pair £ [N] 2 such that i < j, 
we define a family of sets {Bjj}i<j sucn that 

B UJ = a, + t((q - l)a. t + a,-), 2 < t < 1 + L, 

where L is a design parameter of choice. Now, we construct 
a set Sx as follows 

S I = S 1 \jl |J B t A , 

\(i,j)ez / 

where I C [iV] 2 such that (i, j) <E I only if i < j. 

A. Encoding Data 

Let the file to be encoded be T. We first divide the file T 
into K = m symbols b = (pi,..., b m ) £ F™. Given the data 
vector b, we construct a polynomial / b (-) £ F g [a;i, . . . ,x m ] 
of degree at most 1 as 

f b (xi, x m ) = bjXj. (6) 
i=i 

The data vector b is encoded to a vector = 

(/ b (Pi), ■ • ■ ,/ b (P|5 x |)) where {Piliii are distinct points 
of Sx in any prespecified order. Each symbol in is now 
stored on a different node in DSS. Let C x denote the codebook 
obtained using the aforementioned encoding scheme. Note that 

C X C (nM q (l,m)) Sx , (7) 

where (1ZM. q (l, m)) s denotes the codebook obtained by 
puncturing a code lZA4 q (l,m) on 5f . 

Next, we show that the code as detailed above is well 
defined in the sense that the dimension of the code is K. 
Let y = (/(ai), . . . , /(ajy)) T denote the vector containing 
evaluations of / b (-) on Si = {ai, . . . ja^r} C Sx- It follows 
from © that 

y = Gb (8) 

where matrix G is the generator matrix of [N, m, 2eN + l] q 
linear code used to construct Si. Thus, b can be decoded 
from y using a decoding algorithm corresponding to this 
[N,m,2eN + l] q code. This implies that the dimension of 
the proposed code is also K. 



B. Local Node Repair 

In this subsection, we describe a procedure for 2— local 
repair of a DSS based on our code. For each i £ [N], we 
define TZ(i) = {j £ [N] : (i,j) or (j, i) £ I}. We assume that 

1 is such that 

\K(i)\> 1 for all i £ [N] (9) 

i.e., for each i £ [N] we add at least one set Bjj or 
Bj t i while generating Sx- Without loss of generality, a node 
corresponding a point pi £ Sx fails. It follows from (O that 
there exists a set of L+2 points {pi, pi+h, . . . , pi + (L+l)h} 
in Sx- For example, if node corresponding to ai fails and 
(1,2) £ X, then L + 2 points {ai,a2 = ai + (q — l)ai + 
a 2 , . . . ,ai + (L + l)((q - I) an + a 2 )} are in Sx. In this 
example, we have pi = ai and h = (q — l)ai + a 2 . Note that 
these L + 2 points lie on a line, which has a direction h and 
passes through pi. For L > 1, we obtain L + 2 > 3 points 
(including pi) on a line in F™. Moreover, (O implies that 

(C 1 ) 1 - d (TZMf(l,m)) \ s \ 

where (jZM^(l,m)j \ Sx denotes the shortened code of 

TZM. q (l,m) corresponding to set Sx C F^ . We know 
from Lemma Q] and the discussion following it that there 
exist a codeword of 1ZM q (1, m), which is supported on 
these 3 points on a line. Moreover, this codeword is part of 
the shortened code (TZM. q L (l,m)^ \ Sx . Therefore, using this 
codeword in the dual code, we can recover the failed node's 
symbol by accessing encoded symbols corresponding to two 
other points on the line from two other storage nodes. This 
establishes 2-locality, and therefore local repairability of our 
coding scheme. 

In terms of a traditional LDC understanding, the node 
repair process can be viewed as polynomial interpolation using 
at least 2 out of the remaining L + 1 points (excluding 
the point pi associated with the failed node) on the line 
{pi + h, . . . , pi + (L + l)h}. Consider g(t) = / b (p + ih), a 
polynomial over t of degree at most 1, Given its evaluation at 

2 points, {pi + t^h, p + <i 2 h}, we can uniquely recover g(t) 
using any standard polynomial interpolation method. Now the 
desired symbol / b (pi) can be recovered by evaluating g(t) 
at t = 0. 

Remark 1: Note that, once we know the polynomial git), 
we can recover encoded symbols associated with all L + 2 
points on the line defined by the pair (pi,h). This property 
can be used for cooperative node repair in order to reduce 
repair bandwidth by determining a particular line such that 
it comprises of less than L failures and then recovering all 
failures on the line simultaneously. 

C. Code Parameters 

The rate and minimum distance of the proposed coding 
scheme depends on three design parameters, I, L, and 
[N,m,2eN + l] q linear code. In what follows, we pick an 
[N, m, N — m + l] q maximum distance separable (MDS) code 
for [N, m, 2eN + l] q code and analyze two cases: 
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1) Case 1: In this case, we consider! = {(1, 2), . . . , (i, i + 
1),...,(N — (assuming that N is even). Here, we 
have |<Sj|< N + ^L, which results in the rate of the code 
being greater than n J"n l = 0(x)- ^ quick calculation that 
combines locality with (© shows that the proposed code is 
resilient against any N — to + L node failures. Therefore we 
have, 

d min (C x ) > N-m + L + 1 (10) 

It also follows from (O that we can modify our code to 
be a systematic code by picking any set of m rows of G 
(say {ai, . . . , a m }) to be an identity matrix, without affect- 
ing local repairability of the code. Now modify X to be 
{(1, 2), . . . , (m — 1, to)} (assuming that m is even). The upper 
bound on d m j n (C x ) established in Q is applicable in this case. 
For 6 = L + 1 and r = 2 this bound results in: 

d min (C x ) < N+^L-m-(!j-l)L + l 
= N-m + L+1, 

which, along with J10I . proves the optimality of our codes, 
given that locality for information symbols is to be ensured. 
Note that this code is essentially a Pyramid code as presented 
in 0, 0. 

2) Case 2: Next we consider X = € [-/V] 2 : i < j}, 
i.e., \I\= (^). In this case, our rate becomes 

mte{C x ) = « ^— = 9 ( — ). (11) 

' \Sx\ N+{ N 2 )L \mLj 

The lower bound given in dTob holds for this case as well. 
For this choice of X, many points of Si have multiple lines 
passing through them in Si. Thus, it is more likely to be 
able to combine the node failures in groups along a particular 
line; then performing repair simultaneously for all of them by 
contacting just 2 nodes. 

Remark 2: It is evident from previous two cases that set X 
enable us to trade-off rate of the code for flexibility in node 
repair and data access. 

V. Punctured RM Codes with Locality 3 

In this section, we generalize the method of designing codes 
with locality 2 from the previous section to obtain coding 
schemes that are 3-local repairable. As opposed to 2-local 
repairable codes, codes designed in this section are based 
on polynomials of degree at most 2 in ¥ q [xi, . . . , x m ] and 
therefore related to HA4 q (2, m). 

Let Si be the set {ai, ...,ajv} Q FJ? as defined in 
Sec.|lV]with respect to an [N, to, 2eN+l] q linear code. In this 
section, we also require that the maximal hamming weight of 
a codeword in this [N, m, 2eN + l] q linear code is less than 
(1 — 2e)N and minimum distance of its dual code is at least 5. 
We define another set S2 = Si + Si C W* . The requirement 
on minimum distance of dual code implies that Si satisfies 
condition *fl F° r each pair <E [N] 2 with i < j, we 

'For definition of condition *2 and its importance for correctness of 
algorithm AI (denned in Sec. IV-Al . readers may refer to JTJ. 



construct a family of sets {Ai t j}i<j similar to {-Bj,j}j<j in 
Sec. UV] such that 

A i}j = 2a 4 + t((q - l)a. ( + a,-), 3 < t < 2 + L, 

where L is again a design parameter. Now, we generate a set 
Si as follows: 

Sx = S 2 \j[ IJ A t A, (12) 
where X is as defined in Sec. HVl 

A. Encoding data 

Given a file T to be encoded, we divide it into K = ( m ^ 2 ) 
symbols b = [bi, . . . , bx] £ . The data vector b is used to 
construct a polynomial / b (x) e ¥ q [x\, . . . ,x m ] as follows 

f h (xi,...,x m ) = ^fciX^M (13) 

where M is the index set for lexicographically arranged 
irreducible monomials of degree at most 2 in F g [xi, . . . , x m ] 
and x a W — x" ...Xm' n is i th monomial in Ad, 
which is uniquely defined by its exponent vector a(i) = 
[a(i, 1), . . . , a(i, m)}. In order to get codeword correspond- 
ing to data vector b, we evaluate the polynomial / b (x) at 
all points in Si. It follows from ((TJ that we have C 1 = 
(1ZAA q (2, m)) s , where C x denotes the codebook that we get 
from aforementioned encoding procedure. 

In order to show that C x is well defined, i.e., its dimension is 
K, we can potentially utilize an approach similar to that used 
in Sec. [IV] and show that a sub-matrix of the generator matrix 
of C 1 is full rank. However, we follow a different approach 
in which we show that a polynomial interpolation algorithm 
recovers the data polynomial / b (x), therefore the original data 
vector b, from the evaluations of / b (x) on 5*2 C Si. The 
interpolation algorithm is due to Dvir et al. 0], and plays an 
important role in establishing a lower bound on d m i n (C x ) in 
Sec. IV-CI We present an outline of the algorithm in context of 
recovering a polynomial of degree at most 2. Interested readers 
may refer to JT] for complete algorithm and its analysis. 

Interpolation Algorithm Al [Q: For a polynomial 
/(x), we define its partial derivate vector A/(x) = 

^J^-(x), . . . , ^-(x)J and directional derivate in the direc- 
tion of a e F™ 

3/(x,a)=^a 4 -— (x) 

i—l 

It follows from Lemma 2.1 in JTJ that for any a, b £ F™ and 
the polynomial of interest / b (-) of degree at most 2, 

/ b (x + a) - / b (x + b) = 9 /? (x, a - b) + E (14) 

where E is a constant and <9 f b(x) is a degree 1 polynomial 
which represents directional derivative of / 2 (x), homoge- 
neous part of / b (x) with degree 2. Given evaluations of / b (x) 
on S2 the algorithm works as follows: 
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Step 1: Define 

T(i) = Si + a, = (ai + &i, . . . , a N + a t ) 

and 

c£(0 - (c£) rW = (/ b ( ai + at), f h (a N + a,)) 
Note that it follows from (O that c^(i) - c^(j) = (/ b (ai + 



a,)-/ b (ai+aj 



, f h (a N + a. t ) - / b (a N + a 3 ■) ) represents 



evaluations of gij(x) = / b (x+ai) — / b (x+aj), a polynomial 
of degree at most 1, on Si. Thus, using decoding algorithm 
for [N, m, 2eN + l] q code, we can recover <7ij(x) as proved 
in part 2 of Lemma 2.3 in (T). Removing the constant term 
within g%,j(x) results in 9fb(x, a* — a.j). 

Step 2: The algorithm takes all (^) {<9/|>( x i a i — a j)}i<j 
and recover A^b(x). As a homogeneous polynomial can be 
recovered from its partial derivative vector fll], we get / b (x) 
from Afb(x). 

Step 3: Subtract the contribution of evaluations of / b (x) from 
codeword, and recover degree 1 polynomial £ b (x) = / b (x) — 
/ b (x) using the decoding algorithm for [N, m, 2sN+l] q code 
that generates Si. Output coefficients of / b (x) = £ b (x) + 
/ b (x) as the original data vector. 

B. Local Node Repair 

In this subsection, we explain that under the assumption 
similar to (0 on X, all symbols in C 1 have locality at most 3, 
i.e., each failed storage node can be recovered by contacting 

3 storage nodes. Since C 1 = iTZM. q {2,m))s x , we have 
(C I ) ± = (TZMf(2,m))\ s -. 

Following the reasoning used in Sec. UV-Bl with assumption 
that I adds at least one local parity for each point for 
each point p G S%, we can find a set of L + 3 points 
{p, p+^ih, .... p+t^ +2 h} that lie on a line in . We know 
from Lemma[TJand the discussion that follows the Lemma that 
for each set of 4 points on this line there exist a codeword 
of weight 4 in (C 1 ) 1 - = (TZMq(2,m))\ Sx supported on the 

4 points under consideration. Therefore, for each encoded 
symbol / b (p),p G Si, we can locally recover / b (p) by 
contacting a set of 3 nodes storing symbols associated with 

3 points in the aforementioned set of L + 3 points on a line 
defined by the pair (p, h). In fact, / b (p) can be recovered 
without knowing the dual codeword of weight 4 supported on 

4 points (including p) by applying polynomial interpolation 
based local decoding algorithm described in Sec. IIV-BI This 
establishes that C x is 3-local repairable. 

C. Code Parameters 

In this subsection, we study the rate and minimum distance 
of C 1 . For Tji large enough and under the assumption that 
[N, m, 2eN + l] q code satisfies requirements specified in the 
beginning of Sec. [V] it follows from Theorem 1.5 in JT) that 
algorithm Al recovers the data polynomial / b (x) from its 
evaluations on S2 even when fg|<S2 evaluations are incorrect. 
Therefore we have 

=.2 



Similar to Sec. IIV-CI we present rate of C 1 for two choices 
for 1: 

1) Case 1: Here we take 1 = e [N] 2 : i < j}. This 

ensures that each symbol has at least one set of L + 3 points to 
allow its local repair. Note that in this case, for some encoded 
symbols there are multiple line passing through these symbols 
in Si. For example, each symbol corresponding to a point in 
{2&i}f =1 can be repaired along N — 1 lines. In this case, we 
have that |5 Z |< [L + 1)N 2 . Therefore, 



rate(C 3 



(l-2 g ) s 
2L 



' 1 



(16) 



which can be considered as a good rate for locally repairable 
codes, when L is small. 

2) Case 2: Present definition of I and Si utilize the fact 
that {2a t , &i + aj,2aj} lie on a line. We may modify the 
set I to be a subset of [j^l] 2 such that G I only if 

1 < j. The set Si also needs to be modified accordingly. 
Let {pi}]f. 2 i be points of S2 in a prespecified order. Take 

2 = {(1, 2), . . . , (M + l), . . . , (|5 2 |-1, |5 2 |)} (assuming \S 2 \ 
is even). Take a family of sets 

Ci,j = p. t + t((q - l) P i + Pj ), 2<t<2 + L, 

Now, we generate a set Si as follows: 



Si = S 2 \J\ |J c id 



(17) 



With these choices of 1 and Si, each node has one set of 
L + 3 points to exploit for local repairablity, and it translates 



into rate of C being greater than 
rate is at least that in previous case. 



K 



AT2 + «i (L + 1) 



Note that this 
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